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Abstract 

> ' 

In standard quantum theory, the ideas of information-entropy and of pure 
states are closely linked. States are represented by density matrices p on a 
Hilbert space and the information-entropy — ti(plogp) is minimised on pure 
states (pure states are the vertices of the boundary of the convex set of states). 
The space of decoherence functions in the consistent histories approach to gen- 
eralised quantum theory is also a convex set. However, by showing that every 
decoherence function can be written as a convex combination of two other 
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decoherence functions we demonstrate that there are no 'pure' decoherence 
functions. 

The main content of the paper is a new notion of information-entropy in 

generalised quantum mechanics which is applicable in contexts in which there 

is no a priori notion of time. Information-entropy is defined first on consistent 

sets and then we show that it decreases upon refinement of the consistent set. 

This information-entropy suggests an intrinsic way of giving a consistent set 

selection criterion. 
03.65.Bz, 04.60.-m, 98.80.Hw 
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I. INTRODUCTION 



A particularly attractive feature of the consistent histories programme, as developed by 
Gell-Mann and Hartle following pioneering work by Griffiths || and Omnes 0-13] is 
that it offers a framework for quantum theory in which time potentially plays a subsidiary 
roleQ The central idea of the scheme is that under certain consistency conditions it is 
possible to assign probabilities to generalised histories of a system. In normal quantum 
theory such histories are represented by time-ordered strings of propositions; however the 
scheme allows for much more general histories in which there is no a priori notion of time 
ordering. These generalised histories are expected to play a key role in application of the 
formalism to quantum gravity. 

In the generalised version of the history scheme that we have developed |L8|-[2(| the 
central mathematical ingredients are a set of histories UV (or, more accurately, the set of 
propositions about histories) and an associated set of decoherence functions V, with the pair 
(UV, V) being regarded as the analogue in the history theory of the pair (£, S) in standard 
quantum theory where £ is the lattice of propositions and S is the space of states on C 

In this paper we address two related issues: we investigate the structure of the convex 
set of decoherence functions and we suggest a new definition of information-entropy for 
decoherence functions. The analogues of these ideas in standard quantum theory are simply 
related by the fact that the information-entropy I S in g ie-time = — tr(plogp) is minimised on 
the vertices of the boundary of the convex set of density matrices p; these vertices are the 
'pure' density matrices corresponding to pure states in the Hilbert space. As we shall show, 
although convex, the space of decoherence functions has a very different structure and there 
are no 'pure' decoherence functions. For this and other reasons we need a rather different 
approach to the notion of information-entropy in generalised quantum theory. 



1 For more recent developments in the consistent histories programme by these authors see, for 



example, [15-17]. 
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Several other authors |21|-p4| have considered aspects of information theory in the context 
of the consistent histories approach. In particular, in a very interesting paper that partly 
motivated our work, Hartle JTL]] proposed a definition of information that we describe in 
section 3. We feel however that our alternative definition has certain advantages over that 
given in [J21j| : in particular, it is more straightforward. 



II. THE CONVEX SET OF DECOHERENCE FUNCTIONS 



A. 'Pure' decoherence functions 



In ||18l , |i9H we described how the space UV encodes the generalised quantum temporal logic 



of the propositions. As explained in |18|J19|] , there are compelling reasons for postulating 
that the natural mathematical structure on UV is that of an orthoalgebra [pBf| , with the 
three orthoalgebra operations ©, and < corresponding respectively to the disjoint sum, 
negation, and coarse-graining operations invoked by Gell-Mann and Hartle. One example 
of an orthoalgebra is the lattice of projection operators on a Hibert space. In this case, the 
operation © is defined on disjoint pairs of projectors P, Q with P © Q = P 1 V Q where, as 
usual, P V Q denotes the projector onto the linear span of the subspaces onto which P and 
Q project. In the example of a lattice (which is a special type of orthoalgebra), V is defined 
on all projectors, not only on pairs that are disjoint. 

Throughout this paper we shall be dealing with the case where the orthoalgebra of propo- 
sitions is the space of projectors on a Hilbert space V which, for the sake of simplicity, we 
shall take to be finite dimensional. This Hilbert space may arise from having propositions 
at n time points, in which case V = © n 7i (see below), but it need not do so. A crucial ingre- 
dient in our construction of the information-entropy will be the dimension of a proposition, 
defined to be the dimension of the projector that represents the proposition on V. 

The properties of the decoherence function d : UV x UV — > C are 

1. Hermiticity: d(a,j3) = d(/3,a>)* for all a, /3 £ UV. 
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2. Positivity: d(a, a) > for all a G UV . 

3. Additivity: if a and (3 are disjoint then, for all 7, d(a © /3, 7) = d(a, 7) + d((3,j). 

4. Normalisation: d(l, 1) = 1. 

One important motivation for our framework is the fact that discrete-time histories in 
quantum theory can indeed be given the structure of an orthoalgebra. The key idea is that 
an n-time, homogeneous history proposition (a tl ,at 2 , . . . ,ott n ) can De associated with the 
operator © at 2 © • • • © ctt n which is a genuine projection operator on the n-fold tensor 
product 7i tl © TCt 2 © • • • © 7it n of n-copies of the Hilbert-space H on which the canonical 



theory is defined [18,19 



It may be noted that if d\ and d 2 are decoherence functions then so is 

d {X) := Xdi + (1 - X)d 2 (1) 

where A is a real constant < A < 1. Thus the space of decoherence functions is a convex 
set. 

What we have said so far looks very similar to the situation in standard single-time 
quantum theory where — by the use of Gleason's theorem — a state may be characterised by 
a positive self-adjoint operator with trace 1 ( density matrix) on the Hilbert space. 

The probability, Prob(P; p), that the proposition represented by the projection operator P 
is true if the system is in the state p and a suitable measurement is made is 

Prob(P;p) = tr(Pp). (2) 

The space of density matrices is also convex in the sense that 

P(A) := Api + (1 - X)p2 (3) 

is a state if p\ and p 2 are states and < A < 1. In standard single-time quantum theory a 
state is said to be pure if it cannot be written in the form (RJ) with p\ 7^ p 2 - Pure states play 
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an important role since, in this case, the probabilities (0) cannot be interpreted as arising 
from a stochastic mixture. 

Since a state p is a positive self-adjoint operator, the spectral theorem shows that it can 
be written as 



P 



where < r, < 1 are the eigenvalues of p and Pj are the projectors onto the associated 
eigenspaces. This shows that unless all the are zero except one, the state is certainly 



impure. Furthermore, it can be shown |26[ that pure states are of the form 

P = P, (5) 

where P is a projection operator onto a one- dimensional subspace. 

In generalised history quantum theory, although the set of decoherence functions is con- 
vex, in other respects the situation is quite different from that in standard quantum theory. 
One may attempt to define a pure decoherence function d as one that cannot be written in 
the form 

d = \d x + (1 - X)d 2 (6) 

with d\ g?2. However we now show that there are no such decoherence functions. 

Firstly, let us recall |2(| that we have characterised all decoherence functions in the case 
that UV is the lattice of projectors on a finite dimensional Hilbert space as follows f\. 

Decoherence functions are in one-to-one correspondence with 'decoherence operators 7 X 
on V <S> V according to the rule 

d{a, (3) = tr v ® v (a ® f3X) (7) 
where the decoherence operator X satishes 



Generalisations of this result have been given in 

IS- 
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1. MXM = X 1 " where M(u ® v) := v ® u; 

2. tr V0V (a ® aX) > 0; 

3. trvig)v(X) = 1. 

It should be noted that X need not be a positive operator. Indeed, in |19|] we found 
examples of decoherence functions in standard quantum theory where d(a, a) > d(/3,/3) for 
two histories a and (3 for which a < /3, and we also found decoherence functions and histories 
7 for which d(^y, 7) > 1. 

Now consider an operator on V <g> V of the following form 

Y = i{s 1 ® s 2 - s 2 ® si) (8) 
for any self-adjoint operators s\, s 2 on V. It may be seen that Y satisfies 

1. MY M = Y^ where M is the interchange operator given above; 

2. tr V( g,v(" <8> ay) =0 Va; in particular tr v ^ v (y) = tr V(gl v(l <8> 1 Y) = 0. 

Given any decoherence operator Xd associated with a decoherence function d, let us define 
two new operators X~£ and X^ by 

X+ = X d + y and X~ d =X d -Y. (9) 

Then Xj" and X^ are also decoherence operators, as may easily be checked. 
Now consider the identity 

X d = \{X d + Y) + \{X d — Y) = \x+ + \x- d . (10) 

It is clear that if d + and d~ denote the decoherence functions that are associated with the 
decoherence operators X + and X~ respectively then 

d=-d + + -d- (11) 

and hence d is impure. Thus there are no 'pure' decoherence functions^. 



3 Other aspects of the structure of the space of decoherence functions have been considered in [29] 
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B. Pure decoherence functions with respect to a window 



Whilst there are no pure decoherence functions in general, it is possible to discuss a 
notion of purity of a decoherence function in the context of a fixed consistent set. In general 
we shall refer to an exclusive and exhaustive set of propositions (i.e. a resolution of the 
identity in the orthoalgebra UV) as a window W = {ctj} 0. 

Firstly, we define two decoherence functions, d\ and d 2 to be W- equivalent if 

1. W is a consistent set with respect to both d\ and d 2 (i.e., d\(a^ <x,) = d 2 (a. t , aij) = 
for all ati, atj G W with a« ^ acj)] and 

2. di(a>i, ai) = d 2 (ati, ai) for all oij G W. 

It may readily be checked that this is indeed an equivalence relation on the space T> of all 
decoherence functions. 

Each equivalence class of VT-equivalent decoherence functions may be represented by the 
member d^ whose decoherence operator has the 'canonical form' (this decoherence function 
has been useful in other contexts, see pTf ) 

~ <A d(ai,an) 

X= /,7T- ^ai®ai. (12) 

(avmaif 

We shall shortly be making use of the fact that X is a positive self-adjoint operator on V® V. 

Let us denote the space of ^-equivalence classes by T>w- Then 5 G T>w is said to be 
W-pure if it cannot be written in the form 

S = ASx + (1 - X)5 2 (13) 

with Si 7^ S 2 ; if 5 can be written in the form ( |T3"D with Si ^ S 2 with < A < 1, AgM, we 
say it is W -impure. Note that the sum of equivalence classes on the right hand side of (13) 



4 Note that in [30] the word window is used to describe the Boolean algebra generated by this set 
of propositions, rather than the set of propositions itself. 
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is well-defined since if d\ =w d[ and d 2 =w d' 2 then Xdi + (1 — \)d 2 =w Ad' x + (1 — \)d 2 , 
where =w means P^-equivalent. 

Clearly any 5 that can be represented by a decoherence operator X of the form ([12]) 
with more than one non-zero d(a, a) is impure. On the other hand, consider a decoherence 
operator 

X := — 1 . 9 a®a (14) 
(aima) z 

where a is one of the members of W and suppose that its associated 1^-equivalence class 
of decoherence functions 5 can be decomposed in the form (|1~3|). Now 5i(j3,j3) > and 



0) > for all f3. Also, since 5(/5, /3) = for all (3 e W such that f3 ^ a, we have 

5(/3,/3) = = A5 1 (/3,/3) + (l-A)5 2 (/3,/3) V/? e W such that ^ a (15) 
and therefore 

= 5 1 (fi,P) = 8 2 (P,P) V/3 e W such that /3 ^ a. (16) 

Also Sx(l, 1) = 1 and 62(1, 1) = 1, which implies Si(^2 a . eW a ^ ^2 aj ew a i) = ^ which in turn 
implies Si(a, a) = 1, and similarly 8 2 (a, a) = 1. Thus 5i and 5 2 are both equal to 5 and 
hence S is pure. 



III. INFORMATION-ENTROPY 

We turn now to the question of defining the information-entropy in the context of a 
window and for a given decoherence function. In standard single-time quantum theory the 
information-entropy is given by 

I 8 - t = -tr(plogp) (17) 

where p is the density matrix, and, as mentioned in the Introduction, I s _ t is minimised on 
pure states. 
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What we seek is a notion of information-entropy that can be used in generalised history 
quantum theory. In particular, the definition should be applicable in principle to systems in 
which the concept of time is not fundamental and may emerge only in some coarse-grained 
way. Furthermore, even if the system has a standard notion of time, the information- 
entropy — which encodes the number of bits required to describe the system — may not nec- 
essarily all reside in the initial state. The description of a system in this generalised type 
of quantum theory is given entirely in terms of the set of propositions and the values of the 
decoherence function, so we must construct our measure of information-entropy solely from 
these. 

Firstly, however, we point out that since — as explained above — the decoherence function 
can be described in terms of a decoherence operator X, the most naive approach (without 
physical motivation) might be to try to construct a measure of information-entropy for a 
decoherence function from X. The simplest analogue of (|PTD is 

J, = -tr(XlogX) (18) 

but this is not defined in general since X is neither self-adjoint nor positive. 

However, focussing on the probability distributions derived from d does, in fact, offer 
a way of defining information-entropy. To see this consider a general probability distri- 
bution with M events {ej}^ with probabilities {Prob(ej)}^ 1 . The usual measure of the 
information-entropy of this distribution is 

M 

- ^Prob(ei)logProb(ei). (19) 

i=l 

On the other hand, a given decoherence function produces not one but many probability 
distributions, namely one for each consistent window. A possible start, therefore, might be 
to define the information-entropy in the context of a window W = {a{\ as 

jtriai = _^ Pro b(a i )logProb(a i ) 

i 

- - d(aj, at) log d(aj, a € ) (20) 

i 
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which is indeed now well-defined. However, as noted by Hartle [2iP, Iyy al does not have 
the appropriate properties with respect to refinement of the consistent set. In particu- 
lar, we require that if the consistent set is refined — corresponding to having finer-grained 



propositions — the information-entropy should decrease or stay the same. However fl2"0|) does 
not have this property — indeed, the most coarse grained set W = {0, 1} (where 1 is the pro- 
jector onto the whole Hilbert space) is always consistent and has d(l, 1) = 1 and d(0, 0) = 0, 
so that I^ al = 0. As Hartle f2lj puts it, there is no penalty for asking stupid questions. 



In his very interesting paper ||21|| , Hartle considered this problem and proposed the fol- 



lowing definition of 'space-time information-entropy'. First choose a measure of the missing 
information S(d) in the decoherence function d; for example, one could choose a standard 
class C stan d of consistent sets and then define S(d) as 



S(d) := min 

wee stand 



d(a, a) log d(a,a) 



(21) 



where W is varied over all consistent sets of histories in the standard class. Hartle suggests 
that C s tand might be chosen to be the class of finest grained histories that decohere. 



Having chosen S(d) the next step is to use the Jaynes construction |32| to define the 
missing information in a general set of decoherent histories W. The missing information 
is the maximum of the information content of decoherence functions which reproduce the 
decoherence and probabilities of the set W: 

S{W, d) := max \s(d)} _ , (22) 

d L J d(a,a)=d(a,a) 

where the maximum is taken over all decoherence functions d that reproduce the decoherence 
function for the set of histories W. Finally, the missing information in any class C of 
decoherent sets of histories is defined as 

S(C, d) = min [S(W, d)] . (23) 

We feel that the definition of information-entropy that we shall now develop has a number 
of potential advantages over that given by Hartle. In particular (i) it is fairly simple; (ii) 
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it does not need the use of maximum entropy ideas; (iii) it does not require the choice a 
standard class of consistent sets. We shall also show in the next section, when calculated 
in the case of standard quantum theory (for consistent sets of homogeneous histories) the 
information-entropy for the decoherence function is found to be equal to — tr(plogp), up to 
normalisation, where p is the initial density matrix. 

As a first step towards finding this new definition of information-entropy consider any 
window W = {ttj}™ =1 that is consistent with respect to a given decoherence function d. 
Then, as explained above, the canonical decoherence operator X dt w that reproduces the 
values of d(aii, otj) of d in the window W is 



n 



\—\ d(ai, an 

^—^ ( dim c^r 

1=1 

The crucial observation is that, unlike a general decoherence operator, X d>w is a positive, 
self-adjoint operator on V <8> V, and so one can define the logarithm of X d>w and thereby 
form 

- ti(X dtW logX d>w ) = - dfe, oti) log ( — - * J . (25) 

\(dim ai) 1 J 

This motivates the following definition of the information-entropy for this decoherence func- 
tion and window: 

h,w ■= - 2^ d(oi, a t ) log — — (26) 



i=l 



(dimaj) 2 



While this function has many of the propeties that are desired of a measure of informa- 
tion, as we shall show below, there are persuasive arguments 5 for renormalising this function 
and to define our measure of information as 

Id,w '■= Id,w — ^d,{l,0} 



5 We are extremely grateful to J Hartle and A Kent for reading an earlier draft of this paper and 
bringing these issues to our attention. 
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^^'^ )log ^^)- logdimv2 

= -J2 d(*u «„) log d(a ;' Qi) v , 2 , (27) 
^ (dim a;/ dim V) 2 

where ^,{1,0} is the value of Id,w for the (coarsest) window {1,0}, and V is the Hilbert space 
on which the history propositions are defined. 
The function 

dinoV (28) 
is the relative dimension of the projector ctj. Before we go further to describe properties 
of this measure of information-entropy, let us describe the reasons for this use of relative 
dimension rather than absolute dimension of a proposition. The difference is clearly only 
important if one envisages comparing information-entropy in situations where V changes. 
An important case in point is the history version of ra-time quantum mechanics. In this case, 
V = ® n 7i, where V = ® n 7i is the Hilbert space of the canonical theory. One uses n-times 
to model a situation in which 'nothing happens' in the intermediate times. 
Consider a consistent set of n-time histories 

{a i } = {Pi,Pi..Pl,Pl +1 ,...Pl}. (29) 

Now imagine inserting an additional time, between t r and t r +i, say, but use the unit projector 
at this time. The use of relative dimension ensures that the information-entropy does not 
change when one does this trivial extension to the consistent set, since the dimension of the 
history 



(Pi,pi..pl,i,pl +l ,...pi) (30) 



is dim7i times that of 



a i = (Pl,PZ...Pl,I» +1 ,...P$ (31) 

however the dimension of V in (|30|) is dim TC times that of (|3lD . 
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An additional aspect of the use of relative dimension is that it may help in extending 
our work to infinite dimensions, since it may be possible to use von Neumann's theory of 
dimension functions of type Hi algebras of projectors p3| . 

Returning now to consideration of general properties of (|2?|), we note that the definition 
we have given is close to the simple form (|2D|), however the extra factor (dimaj) -2 is the 



crucial ingredient that results in the thus-defined information-entropy being either constant 
or decreasing when the window is refined. To see this, consider two consistent windows 
W\ = {a, a>i, a 2 ■ ■ ■ , ««} and W% = {/3, 7, ai, oti . . . , a n } where W% is a refinement of Wi 
in the sense that one of the projection operators in Wi, namely a, is split into two with 
a = (3 © 7. Thus 



h,wi ~ h,w 2 = ~d{a, a) log ( ff'°j fl ) + d(/3, 0) log 



+ d(7,7)log(^r). (32) 



For simplicity of notation it will be convenient to define the ratios 

d{l,l) , , dim( 7 ) 
a ■= WJ) " drmM (33) 

and, without loss of generality, we can take < a < 00 and 1 < b < 00. Now d(a, a) = 

d(P,P) + d(7, 7) and dim(a) = dim(/?) + dim(7), and hence 

I d)Wl ~ I d ,w 2 = d((3,P) (a log (p) - (1 + a) log (^^)) ■ ( 34 ) 
It is not too difficult to prove the inequality 

alog (£) - (1 + a) log ( ; + "; ) > for < a < 00 and 1 < b < 00, (35) 
Vtr/ \(l + byj 

which implies that Id,Wi decreases with respect to this special type of refinement. However, 
any refinement of W\ can be reached in a step-wise fashion by repeated refining of one 
projection operator into two, and hence Id,w decreases under any refinement. We note in 
particular that with this definition of Id,w the consistent set W = {1,0} has information- 
entropy and that this is the maximum possible value of the function Id,w] the minimum 
possible value of 
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i=l ^ ' 

for any d and W is zero (which occurs if there is a consistent set all of whose projectors 
are one- dimensional and all of whose probabilities, bar one, are zero) so that the minimum 
value of Id,w is —2 log dim V. 

At this stage we might proceed in several different ways. One possibility is to leave the 
information-entropy defined in this 'localised' form I dt w in which the context W appears 
explicitly. This procedure would be rather natural within the topos-theoretic interpretation 



of the consistent histories formalism that was introduced recently by one of us |]30fl . In this 
case it is appropriate to define Id,w for an U window Wo (i.e., any set of exclusive and 
exhaustive histories, not necessarily one that is ^-consistent) as 

Id,w ■= mm I dW (37) 

W>Wo 

where the minimisation is taken over all coarse-grainings W of Wq that are c?-consistent. We 
note that Wq \— > Id,w is a n order-preserving map from the partially-ordered set of windows 
to the ordered set of real numbers — an essentially 'functorial' property in the language of 



A second possibility is to define the information-entropy of the decoherence function d 
as the minimum over all consistent sets of Id,w, *.e. ; 

I d :=mmI diW . (38) 
w 

As will become clear from the examples below the consistent set (or sets) which minimise 
Id,w seem to be naturally associated with the decoherence operator X. 

Before proceeding to illustrate these ideas with examples drawn from standard n-time 
quantum theory it is worth emphasising that this definition of I d is non-trivial; in particular, 
it is not independent of d (as, a priori, it might have been). To demonstrate this point 
consider the decoherence function associated with the following decoherence operator on the 
space V <g> V with V = C 2 : 
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X 1 := - {a® 13 + 13® a) 



(39) 



where 



a 



\ 00 / 



, 



v 01 / 



(40) 



If P is the most general one-dimensional projection operator 



P = 



a b 



b* 1-a 



(41) 



/ 



with agK, 0<a<l and b E C, \b\ 2 = a(l — a) one may easily calculate that 



rf(P,l-P) = -(a 2 + (l-a) 2 ) 



(42) 



so that there are no one-dimensional consistent sets and the only consistent set is {0, 1} 
which has 1^ = 0. On the other hand, the decoherence function d 2 associated with the 
decoherence operator 



X 2 = a <S> ot 



(43) 



on the same space has Id 2 = —2 log 2 (since in this case the set {a, (3} is consistent and has 
I d2 = -2 log 2). 



IV. EXAMPLES 

A. The history version of standard quantum theory 

The definition we have given for Id is in terms of consistent sets and their associated 
probability distributions and it is interesting to see how it reduces up to normalisation to 
the familiar information-entropy 

7 s _ t = -tr(plogp) (44) 
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of standard quantum theory. In this case the histories are simply projectors at one time 
point and the value of the decoherence function is 

d{P,Q)=\x{PpQ). (45) 

Firstly it should be noted that all exhaustive and exclusive sets are consistent since if Pi 
and P 2 are two orthogonal projectors then 

d(P 1} P 2 ) = tv(P lP P 2 ) = tr(pP 2 P 1 ) = 0. (46) 

Secondly since — as shown in the previous section — the information-entropy is constant or 
decreases upon refinement of the window, it suffices to consider windows in which all the 
projectors are one-dimensional. Thus the information-entropy of the decoherence function 
is the minimum over all one-dimensional resolutions of the identity W = {Pi}f =1 of I^w 
(where N is the dimension of the Hilbert space V; of course, in this case, V is just the 
canonical Hilbert space H) 

Id,w = ~Y1 d ^ P *) lo S rf ( P - P i) " 2 log iV 

i 

= -E tT ( P iP P i) logtr(PpP) - 2 logiV 

i 

= -J]tr(pP,)logtr(pP,) -2 log TV. (47) 

i 

This expression for I d W is independent of the basis in which the traces are evaluated and it 
is convenient to evaluate it in the basis in which the density matrix, p, is diagonal: 

h,w = -E {jl(Pi)nr)j log (j2(Pihr^j - 2 log N (48) 

where the (Pi)jj are the diagonal elements of Pj in this basis and rj are the (possibly repeated) 
eigenvalues of p. 

Now the function f(x) = — xlogx is a concave function and hence satisfies the inequality 

f^ljx^j >J2W^) ( 49 ) 
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> 



where the positive real numbers U satisfy < U < 1 and Yli h — 1 (this is essentially 
Jensen's inequality; see for example f34|). 

We shall now use this inequality to get a lower bound on Id,w- The one- dimensional 
projectors have trace 1, i.e., 

^2(Pi)jj = 1 for each i (50) 

j 

and therefore we can use fl49|) for each i with {Pi)jj playing the role of lj, so that 
hw = "E \J2(Pihr^J log (^(I'h^ - 2 logiV 

E \52( p <h (^ lo g^)) - 21ogiV. (51) 
However, since ^ Pi = 1, in any basis we have 

E( P *)ii = 1 for each ( 52 ) 

i 

thus 

> - (j2( p i)ii fa lo S r ^ - 2 logiV 

= - E fa log r i) ~ 2 log ^ 

j 

= -tr(plogp) -21ogiV. (53) 

We note that, if p is non-degenerate, by choosing the Pj to be the spectral projections of p we 
can indeed attain the bound. Hence, if we define the information-entropy of the decoherence 
function to be the minimum of I^w over all W we find 

J d = -tr(plogp)-21ogJV. (54) 

If p is degenerate, we should choose a resolution of the identity by one-dimensional projectors 
obtained by replacing each n-dimensional spectral projector Q by any set of orthogonal 
projectors which sum to Q; again one finds that Id = — tr(plogp) — 21ogiV. 

Thus the definition of information-entropy that we have given reduces, up to normalisa- 
tion, to the usual one in the case of single-time quantum theory. 



B. n-time quantum theory 



We recall that in standard n-time quantum theory a history is a time-ordered sequence 
of propositions about the system a = (P^ ,P? , . . . ,P£j with t\ < t 2 < ■ • • < t n . As we 



have argued in [18,19], this proposition should be associated with the operator (P f * <g> P t 



■ ■ ■ ® P") on the tensor product Hilbert space V = <8> TCt 2 <S> • • • <8> "% n of n copies of the 
Hilbert space 7i on which the canonical theory is defined. We have called histories such as 
(Pt <S> Pt 2 <8> ■ ■ ■ <8> Pt\ represented by a tensor product of operators on V, homogeneous; 
there are, of course, many projectors on V which are not of this form. In |2Uj we have shown 
how to construct the operator X (an operator on V <S> V) in this case so as to reproduce the 
standard expression for the decoherence function, namely 

d(a^)=tr H (Cip tQ C p ) (55) 

where 

C a = UfatjpiUfa^PlUfah) . . . U(t n ^t n )P r t l U(t n ,t ) (56) 

and U(t, t') = e~*(* - * ^ H is the usual time-evolution operator in the Hilbert space H. of the 
canonical theory. 

We now show that the minimum value of the information-entropy l^yv over a U consistent 
sets of homogeneous histories for standard quantum theory is 

- tr(plogp) - 2 log dim V. (57) 

We suspect that this value is the minimum for any consistent sets (i.e., including inhomo- 
geneous histories) but so far we have only been able to prove this in certain examples (see 
below). 

Firstly we note that by taking the projection operators at each time to be related to 
the spectral projectors of p we can find a consistent set that gives the value (|57D as the 
information-entropy of that set. More precisely, choose the histories to be of the form 
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a = (U(t , tJ-^Ufato)- 1 , U(t , t 2 )- l Pi 2 U{t 2 , t )-\ . . . , 

...,U{t 0l t n )- l PlU{t ni t Q )- 1 ) (58) 

where Pi is a one-dimensional projector onto the zth eigenspace of p. The unitary operators 
are needed to 'undo' the time evolution so that expressions for the probabilities become of 
the form 

tv(P n ...P j P iP P i P j ...P n ). (59) 

It is easy to see that all of the histories so defined will have zero probability except those 
of the form (P,, P iy . . . , Pj), i = 1, 2, . . . , N (where N is the dimension of the Hilbert space) 
which have probabilities 

tv(Pi . . . PiPipPiPi ...Pi)= tr{ P Pi) = r, (60) 

where are the eigenvalues of p. Thus the value of the information-entropy in this window 
is 

N 

Id,w — — r i log^j — 2 log dim V = — tr(plog p) — 2 log dim V. (61) 

i 

In order show that the value of I^w i n an y other window of homogeneous projectors 
is greater than this it is helpful to note the following. Let be a resolution of the 

identity by projectors in the Hilbert space 7i, and let K be a positive self-adjoint operator. 
Then 

-t^^igm) 

> -ti(KlogK), (62) 

and also 

-Etr((QW) log (||^) ) >-tr(/nog/0- (63) 

The inequality ( p2| ) is essentially the same as that proven in the previous subsection 
(it should be noted that that proof did not depend on the fact that p had trace 1). The 
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inequality (|63|) may most easily be seen by considering the left hand side in a basis in which 
the <5 J, s are simultaneously diagonal. Then the operator Q J KQ J is of the block diagonal 
form 



Q j KQi 



(0) 



\ 



(0) 



(0) 



V 



(64) 



(0)J 



where is a dimQ- 7 x dimQ- 7 positive self-adjoint matrix. Clearly 

( ()l ,- nn Q j KQ j 
-tr NxN [ Q J KQ J log 



(.lim^TV " - trdim ^ xdim ^ ^ l0g (dimQi)2 

> -tr d i m Q Jx dimQJ (K 3 \ogK 3 ) . 



(65) 



We may now use these results and one from the previous section to find an upper bound 
on the information-entropy for any consistent window of homogeneous projectors. We have 



l d,W 



tr(Pj n Pj n _ 1 . . . Pi 2 Pi x pPi x Pi^ . . . Pi n l P, h 



Jl,l2,."i»n-ljln 



x log tr 



(j»njjjn-l • • • PiiPj\PPj\Pii • • • ^n-l^w) 

(dim P h dim P i2 . . . dim P in _ a dim P in ) 2 
- (dim P n ... dim P^J 2 



2 log dim V 



n,...,i n _i 
x 



E tr ( p - 

x logtr 



jjn-l • • • PiiPP'h • • • £«n-l 

_ (dimP^ . . .dimPj n _J 2 



P- 



(dimPj 



P„_i • ■ • PiPPi • ■ ■ Pj n -\ 
(dim . . .dimP in _ 1 ) 2 



— 2 log dim V (66) 



and hence (B2|) can be used to show that 



> - 



E 

h,i2,—,in-l 



X tr 



(dimP il ...dimP ift _ 1 ) 2 

Pj n -1 ■ ■ ■ PjlPPjl ■ ■ ■ Pi„-! 

_ (dimP, . . . dim Pi n _J 2 



log 



Pjn-1 • ■ ■ Pj\PPj\ ■ ■ ■ Pj n -1 

_ (dim . . . dim Pi n _ x ) 2 
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—2 log dim V 

E tr([P in _ 1 ...P hP P il ...P in _ 1 ]lot 



ji,i 2 ,...,«„-i 

—2 log dim V. 

We may now use (^) to give 



^n-l • • • PhpPii ■ ■ ■ Pj n -i 
_ (dimP^ . . . dimP^) 2 



(67) 



«l,l2,...,i„_2 



>- trl[P in _ 1 ...P ilP P il ...P in _ 1 ]lo, 

11,12,— ,in-l 

= - (dimP n ...dimP n _ 2 ) 2 

P„_2 • ■ • Pi\PP%\ ■ ■ ■ P n - 



Pjn-1 • • • PilPPil ■ ' ■ £jjg-l 

_ (dimP^ . . . dimP in _ 1 ) 2 



2 log dim V 



(dim P ix . . . dim P ir 



P 



ln-1 



x log • 



p- 



P%\PPi\ • ■ ■ Pi, 



(dimPj n J 2 [ (dimP^ . . . dimp 



l~n-2) 



Pi. 



2 log dim V 



11,12, ...,l„_2 



>- tv([P in _ 2 ...P hP P il ...P in _ 2 ]lo k 



Pj„-2 ■ ■ ■ Pj\PPj\ ■ ■ ■ Pj n -2 

(dimP^ . . . dimP in _ 2 ) 2 



2 log dim V. 

(68) 



The right-hand-side of this expression is of the same form as the right-hand-side of 
but with one less summation (n — 2 summations compared to n — 1 in (0)). We may now 
repeat this step recursively to show that 



h,w > tr (p log p) - 2 log dim V. 



(69) 



Thus the minimum of I^w over consistent windows containing only homogeneous his- 
tories is — tr(plogp) — 2 log dim V. In other words, all the information-entropy lies in the 
initial state for standard n time quantum theory with unitary evolution. 

It is worth noting that if the time evolution is non-unitary (such as might occur in 
a space-time region around a black hole) then histories such as (|58| ) cannot be used to 
minimise the information-entropy since the operators at each time are no longer projectors. 
Thus there will be a contribution to the information-entropy from the time evolution in 
addition to that from the initial state. 
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C. Two-time histories 



As was remarked earlier, we suspect that — tr(plogp) — 2 log dim V is the minimum of 
Id,w over all consistent sets although, so far, we have only been able to show this in certain 
special cases. One such is the two-time history version of a quantum system with canonical 
Hilbert space H = C N (i.e., the history Hilbert space is V = H®H = C n2 ) with a unitary 
time evolution and the special initial density matrix 

P = diag( ^'^'^'---'^ ) - (70) 

In fact we may take the time evolution to be trivial (i.e. we choose the Hamiltonian to be 
zero) without loss of generality, since the class of history propositions we will consider takes 
into account all possible unitary evolutions. 

We now show that the minimum of Id,w over all consistent sets is 

-tr w (plogp) -21ogdimV = log iV — 2 log iV 2 = -31ogiV. (71) 

The most general unit vector in the tensor product space V = H <g> H is 

N 

v=^2v ij \i)®\j) (72) 

where {|«)}£Li is an orthonormal basis for Ti and the constants v 1 ^ satisfy 

J2v ij v* ij = l. (73) 

ij 

The one-dimensional projection operator onto the subspace defined by this vector is 

P V = J2 ® b'» (v* km (k\ ® (m\) 

ijkm 

= J2v ij v* km \i)(k\ ® \j){m\. (74) 

ijkm 

If we consider consistent sets that contain only one-dimensional projectors then in order 
to decrease the information-entropy at least one projector must have a probability greater 
than 1/N. However, we will now show that the maximum value of the probability of any 
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one-dimensional projector in a consistent set is 1/N, so that no consistent windows with one- 
dimensional histories reduces the information-entropy below log N — 2 log iV 2 = —3 log N, 
the value obtained by considering windows with only homogeneous histories. We also show 
that (as might be expected) windows with higher- dimensional histories also fail to reduce 
the information-entropy below this value. 
If P v is part of a consistent set then 

d(l-P v ,P v )=0 (75) 

so that 

d(l, P v ) = d{P v + (1 - P v ), P v ) = d(P v , P v ) + d(l - P v , P v ) = d(P v , P v ) (76) 

where we have used the fact that if a and (3 are two disjoint histories, then for any other 
history 7, 

d(a©/?,7) = d(a,j) + d(0,i). (77) 

Thus the probability of this particular consistent history proposition is 

d(P v ,P v ) = d(l,P v ) 

= tT V9V ([l®P v ]X) (78) 

where X is the decoherence operator for this system which may be found in p(J (as a special 
case of the results given there): 

X = [R {2) ® l 2 ]S4[l2® (p® h)][R(2) ® 1 2 ]. (79) 

In this expression, l± is the unit operator on 7i and I2 is the unit operator on V = 7i ® 7i; 
R(2) is the 'time-reversal' operator on V = TC (8> H: 

R(2)U\ ®u 2 = u 2 ® Ui, Rf 2 ) = 1; (80) 

and 5*4 is the map on which acts as 
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S 4 (Ui <8> «2 ® «3 ® Ua) =U 2 ®U?,®Ua® Mi, (81) 

which has the important property that, for any four operators A, B, C, D on 7i, 

tr®4 H {[A ®B®C®D]S 4 ) = ti H {ABCD). (82) 



Thus, 



d(l,P v ) = tT V ® v ([l 2 ® P v ] X) 

= tr 4 W ([l 2 <g> P v ][R m ® l 2 ]S 4 [h ® (p ® li)][i2(2) ® I2]) 

= tr^4 W ([ J R (2) <8) 1 2 ][1 2 <8) P„p (2) (8) 1 2 ]S 4 [1 2 ® (p ® li)]) 

= tr 4 W ([l 2 <g> P^ 4 [l 2 ® (p ® li)]) 

= tr 8 4 W (5 , 4 [l 2 ®{(p® li)P„}]) 

= *W fcm tr^(S 4 [li ® li ® p|z)(A;| (8) |j)(m|]) 

ijkm 

= J2v ij v* km tr H [p\i)(k\j)(m\] (83) 



ij'fcm 

where we have used (]82f). 



Thus 

P„) = — ^ v V v *km 5 im S kj 
ijkm 

= — V 

= — [ y v u v* u + v v ij v* ji I 

J 



— I J2 v u v* u + (~\ vlj - vJi \ 2 + vijv * ij + v ji v* ji ) 

V i i<j 



< — . (84) 

We also note that this calculation shows that for a k dimensional projector P(k), the proba- 
bility 
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d(P {k) ,P ik] ) = d(P ik) ,l)<- (85) 

so that including higher dimensional projectors in a window will only increase the value of 
the information-entropy. Hence, for this example, — tr^(plogp) — 2 log dim V = — 31ogiV is 
the minimum value of I<i,w over all windows. 



V. CONCLUSION 

We have shown that there are no pure decoherence functions in the consistent histories 
approach to generalised quantum theory since every decoherence function may be written 
as the sum of two others. 

More substantially, we have also put forward a definition of information-entropy in gen- 
eralised quantum mechanics that relies crucially on the notion of the dimension of a history, 
a concept that is natural within our approach to the general scheme. It is worth noting 
that fundamental to the consistent histories approach from the start has been the idea of 
taking the sum of two homogeneous histories in standard n-time quantum theory to form 
inhomogeneous histories. However, the idea of the dimension of an inhomogeneous history 
is difficult to understand unless, as we have frequently advocated, histories are identified 
with projection operators on an n-fold tensor product space. 

We have called the function l d W a measure of information-entropy for generalised quan- 
tum mechanics as it has key properties that it decreases under refinement and it is small 
for consistent windows in which the probability is peaked around histories of small dimen- 
sion (as we have shown, decoherence functions may or may not have such windows). The 
fact that Id,w is negative, however means that it is not quite a usual measure of missing 
information. One does, of course, have the option of using the negative of the function Id,wi 
however we have not done so in order to facilitate comparison with other approaches. 

It ought to be said at this stage that while the function l^w has many of the properties 
that one requires of a measure of information-entropy in the space-time context, its true 
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meaning is still somewhat unclear. In this context it should be noted that any function of 
the form 



T d,w = - ^d(ai,ai) log 



d(aii, on) 



(dimaj/ dimV) 



(86) 



i=i 

where x > 1 is a real number also has the key property that it decreases under refinement 
of the consistent set. The case x = 1 may turn out to be the most interesting, as in this 
case, the measure of information is (minus) the Kullback information |35[] of the distribu- 
tion {d(ai,aii)} relative to a 'maximally ignorant' distribution on the set {aj} which has 
Prob(aj) = dim(aj)/dimV. The relationship between the measures with different values of x 
needs to be understood. Interestingly, Gell-Mann and Hartle |TJj have considered measures 



of this sort as a result of rather different considerations such as thermodynamic depth j36| . 
We understand |37j that they have also considered a measure of entropy which they call a 



'bundle of histories entropy' which takes into account the number of fine-grained histories 
in a coarse-grained history; this idea is clearly related to the one we have put forward. 

We anticipate that our definition of information-entropy — which is a straightforward 
function on the class of consistent sets with attractive properties under refinement — may 
help in the development of a set selection criterion^ for example, in the case that the system 
naturally divides into a subsystem and the 'environment', this might be done by selecting the 
set which minimises the information-entropy of the distinguished subsystem (see for example 
39|). This is an important problem to which we intend to return in future work. Related 



issues that need to be understood are the role of symmetries (see for example [p9| , ^0| , p!| ) and 



the existence of quasi-classical domains and their relation to the system-environment split. 
In the context of the latter, it should be noted that if our vector space V happens to arise as 
the tensor product of two spaces Vi and V2, then our definition of information-entropy has 
precisely the behaviour that might be hoped for. For if one considers a consistent window 



3 The importance of this issue for the whole framework has been discussed by Dowker and Kent 
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in which each history proposition a is a tensor product a = ot\ <8> «2, with ot\ G -P(Vi) and 
«2 £ -^(^2), then the information-entropy is the sum of the information-entropy associated 
to each sub-system. 
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